#discriminative classifiers29/04/2025
THINKPRM: Revolutionizing Scalable Reasoning Verification with Generative Process Reward Models
THINKPRM introduces a generative process reward model that significantly improves reasoning verification with minimal supervision, outperforming traditional discriminative models across key benchmarks.